Identification of prognostic genes and gene sets for early-stage non-small cell lung cancer using bi-level selection methods

نویسندگان

  • Suyan Tian
  • Chi Wang
  • Howard H. Chang
  • Jianguo Sun
چکیده

In contrast to feature selection and gene set analysis, bi-level selection is a process of selecting not only important gene sets but also important genes within those gene sets. Depending on the order of selections, a bi-level selection method can be classified into three categories - forward selection, which first selects relevant gene sets followed by the selection of relevant individual genes; backward selection which takes the reversed order; and simultaneous selection, which performs the two tasks simultaneously usually with the aids of a penalized regression model. To test the existence of subtype-specific prognostic genes for non-small cell lung cancer (NSCLC), we had previously proposed the Cox-filter method that examines the association between patients' survival time after diagnosis with one specific gene, the disease subtypes, and their interaction terms. In this study, we further extend it to carry out forward and backward bi-level selection. Using simulations and a NSCLC application, we demonstrate that the forward selection outperforms the backward selection and other relevant algorithms in our setting. Both proposed methods are readily understandable and interpretable. Therefore, they represent useful tools for the researchers who are interested in exploring the prognostic value of gene expression data for specific subtypes or stages of a disease.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Time-dependent Prognostic Factors on Survival of Non-Small Cell Lung Cancer using Bayesian Extended Cox Model

  Abstract Background: Lung cancer is one of the most common cancers around the world. The aim of this study was to use Extended Cox Model (ECM) with Bayesian approach to survey the behavior of potential time-varying prognostic factors of Non-small cell lung cancer. Materials and Methods: Survival status of all 190 patients diagnosed with Non-Small Cell lung cancer referring to hospitals in ...

متن کامل

Prognostic value of various metabolic parameters on pre-treatment 18-F-FDG PET/CT in patients with stage I-III non-small cell lung cancer

Background: the aim of this study was to investigate the prognostic value of 18Fluorine-fluorodeoxyglucose positron emission tomography/computed tomography (18F-FDG PET/CT) parameters in both overall survival and progression-free survival in Stage I-III non-small cell lung cancer (NSCLC). Materials and Methods: In this retrospective study, 267 patients who were diagnosed as Stage I-III non-smal...

متن کامل

Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest

Background & objective: Microarray and next generation sequencing (NGS) data are the important sources to find helpful molecular patterns. Also, the great number of gene expression data increases the challenge of how to identify the biomarkers associated with cancer. The random forest (RF) is used to effectively analyze the problems of large-p and smal...

متن کامل

Survival and Prognostic Factors in Small Cell Lung Cancer Patients in Turkey

Background: Small cell lung cancer (SCLC) is a highly aggressive tumor. Objective: To evaluate the survival and time to progression of patients with SCLC admitted to a chest disease center in Istanbul, Turkey. Methods: Based on the reports of a pulmonary oncology clinic, data regarding performance status (PS), clinical stage of disease, treatment, time to progression and survival of 67 patients...

متن کامل

Hybrid Models Identified a 12-Gene Signature for Lung Cancer Prognosis and Chemoresponse Prediction

BACKGROUND Lung cancer remains the leading cause of cancer-related deaths worldwide. The recurrence rate ranges from 35-50% among early stage non-small cell lung cancer patients. To date, there is no fully-validated and clinically applied prognostic gene signature for personalized treatment. METHODOLOGY/PRINCIPAL FINDINGS From genome-wide mRNA expression profiles generated on 256 lung adenoca...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2017